Inferring parts of speech for lexical mappings via the Cyc KB
نویسندگان
چکیده
We present an automatic approach to learning criteria for classifying the parts-of-speech used in lexical mappings. This will further automate our knowledge acquisition system for non-technical users. The criteria for the speech parts are based on the types of the denoted terms along with morphological and corpus-based clues. Associations among these and the parts-of-speech are learned using the lexical mappings contained in the Cyc knowledge base as training data. With over 30 speech parts to choose from, the classifier achieves good results (77.8% correct). Accurate results (93.0%) are achieved in the special case of the mass-count distinction for nouns. Comparable results are also obtained using OpenCyc (73.1% general and 88.4%
منابع مشابه
Inducing criteria for lexicalization parts of speech using the Cyc KB
We present an approach for learning part-of-speech distinctions by induction over the lexicon of the Cyc knowledge base. This produces good results (74.6%) using a decision tree that incorporates both semantic features and syntactic features. Accurate results (90.5%) are achieved for the special case of deciding whether lexical mappings should use count noun or mass noun headwords. Comparable r...
متن کاملL2 Learners’ Lexical Inferencing: Perceptual Learning Style Preferences, Strategy Use, Density of Text, and Parts of Speech as Possible Predictors
This study was intended first to categorize the L2 learners in terms of their learning style preferences and second to investigate if their learning preferences are related to lexical inferencing. Moreover, strategies used for lexical inferencing and text related issues of text density and parts of speech were studied to determine their moderating effects and the best predictors of lexical infe...
متن کاملDesign and Implementation of an Intelligent Part of Speech Generator
The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...
متن کاملThe Role of Private Speech Produced by Intermediate EFL Learners in Lexical Language Related Episodes
Private speech utilization is accepted to have a critical role in the continuum of language acquisition. As a valuable device in studying learners’ talk during interaction, a language related episode (LRE) is any part of a dialogue where a student speaks about a language problem s/he comes across while completing a task. The present study investigated the role of private speech produced by Inte...
متن کاملRepresentational Interoperability of Linguistic and Collaborative Knowledge Bases
Creating a Natural Language Processing (NLP) application often requires to access lexical-semantic Knowledge Bases (KBs). Recently, Collaborative Knowledge Bases (CKBs) such as Wikipedia and Wiktionary1 have been recognized as promising lexicalsemantic KBs for NLP (Zesch et al., 2008b), complementing traditional Linguistic Knowledge Bases (LKBs). As CKBs differ significantly from LKBs concernin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004